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Abstract 



We study in which way Kolmogorov complexity and instance complexity 
affect properties of r.e. sets. We show that the well-known 21ogn upper bound 
on the Kolmogorov complexity of initial segments of r.e. sets is optimal and 
characterize the T-degrees of r.e. sets which attain this bound. The main part 
of the paper is concerned with instance complexity of r.e. sets. We construct 
a nonrecursive r.e. set with instance complexity logarithmic in the Kolmogorov 
complexity. This refutes a conjecture of Ko, Orponen, Schoning, and Watanabe. 
In the other extreme, we show that all wtt-complete set and all Q-complete sets 
have infinitely many hard instances. 

Key words: Kolmogorov complexity, instance complexity, recursively enumerable 
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AMS (MOS) subject classification: 03D15, 03D32, 68Q15. 

1 Introduction 

Intuitively, Kolmogorov complexity measures the "descriptional complexity" of a 
string x. It is defined as the length of the shortest program that computes x from the 
empty input. Accordingly, the Kolmogorov complexity of initial segments of a set A 
is considered as a measure of the "randomness" of A. It is well-known that for r.e. 
sets the Kolmogorov complexity of initial segments of length n is bounded by 2 logn. 
We show that this bound is optimal and characterize the Turing degrees of r.e. sets 
which attain this bound as the array nonrecursive degrees of Downey, Jockusch, and 
Stob g. 

Ko, Orponen, Schoning, and Watanabe J7|, 12] have recently introduced the notion 



of instance complexity as a measure of the complexity of individual instances of A. 
Informally, ic(x : A), the instance complexity of x with respect to A, is the length of 
the shortest total program which correctly computes Xa{%) anu does not make any 
mistakes on other inputs, but it is permitted to output "don't know" answers. It is 
easy to see that the Kolmogorov complexity of x is an upper bound for the instance 
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complexity of x (up to a constant). A set A has hard instances if for infinitely many x 
the instance complexity of x w.r.t. A is at least as high as the Kolmogorov complexity 
of x (up to a constant which may depend on A), i.e., the trivial upper bound is already 
optimal. 

Orponen et al. conjectured in [11, 12|] that every nonrecursive r.e. set has hard 



instances ("Instance Complexity Conjecture (ICC)"). Buhrmann and Orponen |2j 
proved ICC for m-complete sets. Tromp |L4j] proved that the instance complexity of x 
w.r.t. any nonrecursive set A is infinitely often at least logarithmic in the Kolmogorov 
complexity of x. We construct an r.e. nonrecursive set which attains this lower bound 
for all x. In particular, this is a counterexample to ICC. On the positive side, we show 
that ICC holds for wtt-complete sets, Q-complete sets, and hyperhypersimple sets. 
But ICC fails for a T-complete set, since it fails for an effectively simple set. However, 
ICC holds for all strongly effectively simple sets. We also investigate a weak version 
of instance complexity, where programs may not halt instead of giving "don't know" 
answers. 

The resource-bounded version of instance complexity is also well-studied; we refer 
the reader to @, |, §, [TJ. 

Notation and Definitions: 

The notation generally follows ||. For further recursion theoretic background we 
refer the reader to [lCj, For p G {0, 1}*, l(p) denotes the length of p; A is the 
empty string. We use the special symbol _L to denote the "don't know" output. 
Xa is the characteristic function of A. We identify M and {0, 1}* via the canonical 
correspondence as in [|[ p. 11]. 

Definition 1.1 (Chaitin, Kolmogorov, Solomonoff) 

For any partial recursive mapping U : {0,1}* x {0,1}* — > {0,1}* U {_L} and any 
x G {0, 1}* we define Cu{x) = min{/(p) : U(p, A) = x}, the Kolmogorov complexity of 
x in U . If no such p exists then Cjj(x) = oo. 

It is helpful to think of U as an interpreter which takes a program p and an input 
z and produces the output U (p, z) 

Instance complexity was introduced in [7j in order to study the complexity of single 
instances of a decision problem. 

Definition 1.2 (Ko, Orponen, Schoning, Watanabe, 1986) 

Let A C {0, 1}*. A function / : {0, 1}* -> {0, 1, _L} is called A-consistent if f(x) = 
Xa(%) V f(x) =_L, for all x G dom(f). The instance complexity of x with respect to 
A in U is defined as 

icu{x : A) = min{l(p) : Xz. U(p,z) is a total A-consistent function 

such that U(p,x) = xa(x)}. 

If no such p exists then icy[x : A) = oo. 

If we drop in the definition of ic\j the requirement that Xz. U(p, z) is total, then 
we obtain a weaker notion of instance complexity, which we denote by icjj(x : A). 
Note that icu{x : A) < ic\j{x : A) for all x, A. 
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It is well-known (see ||) that there exist "optimal" partial recursive functions U 
such that, for every partial recursive mapping U', there is a constant c with Cu(x) < 
Cu'(x) + c, icjj(x : A) < icjj>(x : A) + c, and zc[/(x : A) < icjj>(x : A) + c, for all x, A. 

For the following we fix an optimal mapping U and write C(x), ic(x : A), and 
ic(x : A), for Cu(x), icjj{x : A), and zc;y(x : A), respectively. We also write U s (p,z) 
for the result, if any, after s steps of computation of U with input (p, z). C s (x) denotes 
the approximation to C(x) after s steps of computation (i.e., with U s in place of U in 
the definition of C(xj). Clearly, C s+1 (x) < C s (x) and C t (x) = C(x) for all sufficiently 
large t. 

The instance complexity of x can be bounded by the Kolmogorov complexity of x 
in the sense that for every set A there is a constant c such that ic(x : A) < C(x) + c 
for all x. Informally, x is a hard instance of A if this upper bound is also a lower 
bound. This was the motivation for the following definition (which is independent of 
the choice of the optimal U) . 

Definition 1.3 (Ko, Orponen, Schoning, Watanabe, 1986) 
A set A has hard instances if there is a constant c such that 

ic(x : A) > C(x) — c for infinitely many x. 

If the condition holds with ic in place of ic, we say that A has hard instances with 
respect to ic. 

Remark: The difference between ic and ic is perhaps best explained by an example: 
Suppose that A is an r.e. set and we want to define a program p such that it 
witnesses ic(x : A) < \p\ for all x with C(x) < n. Since p has to be total we have 
to define it for every input z at some step s. If z has already appeared in A there is 
no problem, we set U(p,z) = 1. If z has not yet appeared in A and C s (z) > n, we 
could try to define U (p, z) =_L, but this can later become incorrect if it turns out that 
C(z) < n. If we set U(p, z) = and z later appears in A, then p is also incorrect. 

In the case of ic we have more freedom: We may leave U(p, z) undefined until 
C s (z) < n at some stage s. If this never happens, then U(p,z) is undefined and 
C(z) > n, which is fine. Still the second source of error remains: If C s (z) < n and z 
has not yet appeared in A at stage s, we have to define U(p, z), and the best we can 
do is to set U(p, z) = 0. But this may later turn out to be incorrect. 

2 A version of Barzdin's Lemma 

In this section we consider the Kolmogorov complexity of initial segments of r.e. sets. 
For A C Af and n G Af we write \a \ n for the string Xa(0) • • • Xa{k)- 

Let us first recall what was previously known. The conditional complexity of a 
string a of length n is defined as C(a\n) = min{/(p) : U(p, n) = cr}. We write C{xA\ n ) 
for C(xa t i n ~ Barzdin (Q, see || Theorem 2.18]) characterized the worst 

case of the conditional complexity for initial segments of r.e. sets: 

• For every r.e. set A there is a constant c such that for all n: 
C{xaW) < logn + c. 
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• There is an r.e. set A such that C(xA\n) > logn for all n. 

Now we look at the standard Kolmogorov complexity C (xa \ n ) ■ Utilizing a result 
of Meyer ||, p. 525], Chaitin proved that, if there is constant c such that for all n, 
C(xa t n ) — logn + c, then A is recursive [[| Theorem 6], [|, Exercise 2.43]. 

For every r.e. set A there is a constant c such that C(xa \ n) < 2 logn + c for 
all n (see ||, Exercise 2.59]). On the other hand, there is no r.e. set A such that 
C{xa \ n ) > 2 logn — 0(1) for almost all n. This follows from the argument in |8], 
Exercise 2.58]. 

In H, Exercise 2.59] it is stated as an open question (attributed to Solovay) whether 
the upper bound 2 logn is optimal. The following result shows that this is indeed the 
case. For ease of conversation, we say that A is complex if there is a constant c such 
that C (xa \ n) > 2 log n — c for infinitely many n G N '. 

Theorem 2.1 There is an r.e. complex set. 

Proof: Let t = 0,t k+ i = 2* fc , and I k = (t k ,t k+1 ], for all k > 0. (Ik) is a sequence 
of exponentially increasing half-open intervals. 

Let f(k) = £fc£+i(* -t k + 1), g(k) = max{Z : 2 l+l - 1 < f(k)}. Note that 
f( k ) = - (!) an d 9( k ) = 2\ogt k+1 - 2 - o(l), for k -> oo. 

We enumerate an r.e. set A in steps as follows: 

Step 0: Let A = 0. 

S^ep s + 1: Let A s+1 = A s . For k — 0, . . . , s do: If C s (xa s \ n ) < g{k) for all n <E I k 
then enumerate min(yl s D Jfc) into A s+ i. □ 

Let A = U s >qA s . Suppose for a contradiction that C(xa \ n) < g{k) for all n G I k . 
Then we eventually enumerate every n £ I k into A. Note that for fixed n there are 
at least n — t k + 1 different strings cr = xa s t n with 1(a) = n + 1 and C(cr) < ^(/c). 
(The suffix of xa s \ n runs through i :r o ri ~ ifc_:c for x — 0, . . . , n — t k .) Thus, there are 
at least f(k) many different strings which all have Kolmogorov complexity at most 
g(k). This contradicts the definition of g(k). 

So for every k there exists n G I k with C(xa \ n ) > g(k), i.e., C(xa \ n ) > g(k) > 
2 logn — 2 — o(l). Thus, A is complex. | 

We now characterize the degrees of r.e. complex sets. Downey, Jockusch, and 
Stob [|j] introduced the notion of an array nonrecursive set. This captures precisely 
those r.e. sets that arise in multiple permitting arguments. In |4[] several other natural 
characterizations of this degree class are given. 

An r.e. set A is called array nonrecursive with respect to {F k } k& _\f if 

(ye)(3°°k)\W e nF k = AnF k ]. 

Here {F k } k( zjy denotes a very strong array. This means that {F k } k( zj^ is a strong array 
of pairwise disjoint sets which partition M and satisfy|Ffc| < for all k G M . 

An r.e. set is array nonrecursive if it is array nonrecursive with respect to some 
very strong array {F k } ke ^. A degree is called array nonrecursive if it contains an r.e. 
array nonrecursive set. Not every r.e. nonrecursive degree is array nonrecursive @, 
Theorem 2.10]. 
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Theorem 2.2 The degrees containing an r.e. complex set coincide with the array 
nonrecursive degrees. In addition, if A is r.e. and not of array nonrecursive degree, 
then for every unbounded, nondecreasing, total recursive function f there is a constant 
c such that 

C(xa \ n) < logn + f{n) + c for all n £ Af. 

Proof: Note that, in order to make A complex, we only need to complete the 
construction from the previous theorem for infinitely many intervals. It follows that 
every r.e. set A, that is array nonrecursive with respect to {Ik}kejv, is also complex. 
In j|, Theorem 2.5] it is shown that every array nonrecursive degree contains such a 
set, i.e., it contains an r.e. complex set. 

For the converse we use f|, Theorem 4.1]. It states that if A is r.e. and does 
not have array nonrecursive degree, then for every total function g A there is a 
total recursive approximation g(x, s) such that \im s g(x, s) = g(x) and \{s : g(x, s) ^ 
g(x, s + 1)}| < x, for all x £ Af. Actually, in [|J this is only stated for 0/1-valued g, 
but the proof provides the more general version. 

Let A be r.e. and not of array nonrecursive degree. Assume we are given any total 
recursive, nondecreasing, unbounded function /. Let m(x) = 1 + max{n : f{n) < x}; 
m is total recursive. Let g(x) = xa \ m(x). Since g is recursive in A, there is a total 
recursive approximation g(x, s) as above. 

How can we describe xa \ n ? Given n we compute n' = min{x : m{x) > n}. 
Then we simulate g(n',s) until it outputs g(n'), which gives us xa \ n. In order to 
perform the simulation we only need to know the exact number x < n' of mindchanges 
of g(n',s). Thus, xa I n is specified by the pair (x,n r ) which can be encoded by a 
string of length logn + 2 log(a; + 1) +0(1). Since m(x — 1) < n we have f(n) > x, by 
the definition of m. Thus, we get 

C(xa \ n) < \ogn + 21og(x + 1) + 0(1) < \ogn + f(n) + 0(1). 

This completes the proof of the theorem. | 

Note that Theorem |2.2| entails the following curious gap phenomenon. For every 
r.e. degree a there are only two cases: 

(1) There is an r.e. set A £ a such that 
(3°°n)[C(xA \ n) > 21ogn- 0(1)]. 

(2) There is no r.e. set A £ a and e > such that 
(3°°n)[C( X A \n) > (1 + e) hgn - 0(1)]. 

3 The Instance Complexity Conjecture fails 

In this section we determine the least possible instance complexity of nonrecursive r.e. 
sets. Here it is convenient to take A as a subset of {0, 1}*. Clearly, if A is recursive 
then ic(x : A) is bounded by a constant for all x. The next result (another gap 
theorem) shows that, for infinitely many x, ic(x : A) must be at least logarithmic in 
C (x) if A is nonrecursive^. 

1 This result was previously announced by Tromp Jl4fl . 
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Theorem 3.1 If ic(x : A) < logC(x) — 1 for almost all x, then A is recursive. 

Proof: Let P k = {0, l}- fc and let V(P k ) denote the set of all subsets of Pk- Uni- 
formly in k we enumerate a finite set B k C {0, 1}*. 

Step 0: Let S k = V(P k ) and B k = 0. 

Step n + l: Search via dovetailing for / G Sk, x G {0, 1}* and s G Af such that 
U s (p, x) =_L for all p G I. If such an I is found, then enumerate x into remove I 
from S k , and go to step n + 2. □ 

Note that is nonempty, since 7 = trivially satisfies the condition for all x, s. 
Also, at most \V(P k )\ = 2 |Pfc| elements are enumerated into B k and is uniformly 
r.e. Thus, there is a partial recursive function if) : {0, 1}* x {0, 1}* — > {0, 1}* such that 
V>({0, l} |Pfcl , A) = B k for all jfe. In particular, C^(x) < \P k \ = 2 k+1 - 1 for all x G B k . 
Choose a constant c such that C{x) < C^(x) + c for all x. 

Let A C {0, 1}* be given and suppose that ic(x : A) < logC(x) — 1 for almost all 
x. Then ic(x : A) < log(C^,(x) + c) — 1, so ic(x : A) < logC^(x) for almost all x. 
Since |fog C</,(a;)J < [log(2 fc+1 — 1)J — k for all x G -Bfc, we can choose k large enough 
such that for all x G B k we have ic(x : A) < A;. 

Thus, for each x & B k there is p G such that A^. U (p, z) is a total A-consistent 
function with U(p,x) = Xa(x). Let I = {p G P k : A2;. U{p,z) is a total A-consistent 
function}. This set is nonempty since B k is nonempty. 

Now consider the construction of B k . Note that Iq cannot be removed from Sk- 
Otherwise there exists x G B k such that C^(x) < 2 h+1 — 1 and U{p,x) =_L for all 
p G Iq, i.e., ic(x : A) > k, contradicting the choice of k. Since Jo is never removed 
from Sk, it follows from the construction of B k that for every x there is p G Jo with 
U(p,x) = Xa( x )- Thus, if we amalgamate all of the functions U{p, — ) with p G Jo? we 
get a recursive characteristic function of A, i.e., A is recursive. | 



We prove that the lower bound of Theorem |3J] is tight even for nonrecursive r.e. 



sets. This refutes the Instance Complexity Conjecture of Orponen et al. [ IT] , [12 
H Exercise 7.41], stating that every nonrecursive r.e. set has hard instances. In 
contrast, our result together with Theorem EO shows that the true threshold between 



the instance complexity of recursive and nonrecursive sets is logC(a;) instead of C(x). 

Theorem 3.2 There is a nonrecursive r.e. set A and a constant c such that 

ic{x : A) < logC(x) + c for all x. 

Proof: It suffices to construct a nonrecursive r.e. set A and a partial recursive 
function if) such that ic^(x : A) < logC(x) + 2 for almost all x. In the following we 
write if> p for \z.ip(p, z). 

Let E k = {x : C{x) < 2 k — 2} for k > 1. We want to establish that ic^(x : A) < k 
for all x G E k . Let M k = {p kj i, . . . ,p k ,2 k -2} denote the set of the first 2 k — 2 strings 
of length k. The idea is that every if) Pk . is A-consistent and for each 16^ there is 
p G M k such that ip p witnesses that ic^(x \ A) <k. There is, however, some difficulty 
to combine this with the requirement to make A nonrecursive. 

The basic idea to satisfy the latter requirement is as follows: For each e > 1 we 
establish a unique diagonalization value d e , then we wait until d e is enumerated into 
W e , if this ever happens we enumerate d e into A. Here {W e } ee _tf is the standard r.e. 
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listing of all r.e. sets of strings. Hence, this strategy makes sure that A is not r.e., so 
A is a nonrecursive r.e. set. 

Suppose that d e appears in Ek before it appears in W e . If we define ip Pk .(d e ) = 
for some i, then, since ip Pki should be A-consistent, we can no longer enumerate d e 
into A. This threatens our diagonalization strategy. On the other hand, we certainly 
should make sure that ic^(d e : A) < k. 
This conflict is solved by a finite-injury priority argument: 

If e > k and we are forced to define ip Pki (d e ) = 0, then we assign a new much larger 
value to d e and try to diagonalize at this new value. Note that d e is changed only 
finitely often, because there are only finitely many values which may appear in Ek for 
some k < e. Thus, the value of d e eventually stabilizes and the e-th diagonalization 
strategy goes through with this final value. 

If e < k then we do not use ip Pk i to ensure that ic^(d e : A) < k. Thus, we define 
if) Pkti (d e ) =_L, which certainly maintains the A-consistency. Instead we will have two 
special programs r ej i, r e)2 of length e (which are not in M e ; this is the reason why we 
have left out two strings) to witness that ic^(d e : A) < e < k. More precisely, if the 
final <i e -value is not enumerated into A, then ip Te l will be the correct function. If the 
final <i e -value is enumerated into A, then ip TeA will not be A-consistent but i]) T 2 is 
used as a back-up function. 

It remains to explain how only \Mk\ many programs can take care of all of the elements 
in Ek, which may be up to 2' Mfc ' — 1 many. We show in an example how two programs 
Pi,P2 can take care of 3 = 2 2 — 1 elements (for simplicity, we drop the distinction 
between numbers and strings): At the beginning ip pi1 ip P2 are undefined. Now in step 
Si the first element x\ < S\ appears. We let ip pi (x) = Xa{%) f° r an x < s i- 111 the 
following steps s we define ip pi {s) =_L until the second element x 2 appears, say at 
step s 2 > #2- If x 2 < Si we do nothing. If x 2 > Si then we define ip P2 {x) = Xa(%) 
for all x < s 2 and in the following steps t we define ip P2 {t) =_L. The point is that 
ip P2 also takes care of xi, thus we suspend the definition of ip Pl until a third element 
x s appears at step s 3 > x 3 . If x 3 > s 2 then we resume the definition of ip Pl and let 
ip Pl (x) = Xa{ x ) f° r all s 2 < x < S3. For arguments t > S3 we define both function 
equal to _L. Note that now p\ and p 2 together take care of X\,x 2 ,x^. 

This idea is easily generalized: Let succ(a) denote the lexicographical successor of 
a, i.e., if a — b\ . . . b n ^ l n then succ(cr) = i_1 16j + i . . . b n where i = min{j : bj = 0}. 
Then the programs pk,i G M k with s-ucc^(0' Mfe ')(i) = 1 take care, if exactly m 
elements are enumerated into (In the implementation below we count only those 
elements which are not <i e -values for some e < k.) Note that, since m < 2' Mfc ' — 1, 
succ is never applied to ll Mfc l. 

We now turn to the detailed implementation. First we fix some additional notation 
and conventions. Let (— , — ) denote a recursive pairing function which is increasing 
in its second argument. We assume that elements of Ek are enumerated in steps such 
that in each step at most one new element is enumerated; also if x is enumerated in 
step s then l(x) < s. W CjS is the finite set of strings which are enumerated into W e in 
at most s steps of computation. 

In the construction the variables e, i, j, k, n, s, t denote numbers, and p, x, z denote 
strings. In addition, the following variables are used: ip PyS the finite portion of ip p 
constructed up to stage s; the i-th bit of <7 kyS £ {0, l}' Mfc ' tells us whether ip Pk . is 
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currently assigned to take care of the elements in E k ] len(k, s) is the greatest length 
n such that our set-up at stage s guarantees that ic^(x : A) < logC s (x) + 2 for all 
x G Ek, s with l(x) < n; d e (s) is the current value of the e-th diagonalization point. 
We call e "active" as long as no <i e -value has been enumerated into A, otherwise we 
call e "passive". So, if e is "passive", then we know that we have explicitly satisfied 
the e-th diagonalization requirement. A s denotes the finite set of elements which have 
been enumerated into A up to stage s. 

Let R(k, s) = {d e (s') : e < k A s' < s}. As explained above, the programs in M k 
do not need to take care of the elements in R(k, s). 

If one of the variables v(s) is not explicitly changed at stage s + 1, then we assume 
without further mentioning that v(s + 1) = v(s). 

We first describe the construction of ip p for p G M k , k > 1. Then we define ip p for 
the two special values p = r e; i, r 6i2 of each length e. 

Construction: 

Stage 0: Let tp p = Xx. |, for all p E {0, 1}*. For all k > 1: a kfi = 0^;len(k, 0) = 0; 

4(0) = 0< fe '°>; declare k as "active". Let A = 0. 
Stage s + 1: 
Case I: s is even. 

For e = 0, . . . , s: If e is active and d e (s) G W e>s — A s , then enumerate d e (s) into A 
and declare e "passive" . 
Case II: s is odd, s = 2(k,t) + 1. 

Let ^x) =_L, for all % with o~k, s {i) = 1 and all x with l(x) = t. 
If a new element x, /(x) < t, enters E k after exactly t steps, then act 
according to the following cases: 

a. ) If x G R(k, s) or l(x) < len(k, s) then go to stage s + 2. 

b. ) Otherwise do the following: 

Let ak,s+i = succ(ak,s) and i = min{j : (Tk,s+i(j) — !}• (At at most 2l Mfc l — 1 
elements are enumerated in E k , so we get o"fc,s+i G {0, l}l Mfc l — {0' Mfe '}.) 
Let n = min{/(z) : z G" dom(ip Pki )}. 

Define tp Pki (z) = xa 3 ( z ), for all z G" -R(/c, s) such that n < l(z) < t. 
Let i/j pki ( z ) = -L> fo r all -2 G R(k, s) such that n < l(z) < t. 
Let len(k, s + 1) = t + 1. For all active e > fc, let 4(s + 1) = <e ' s+1) . 
Go to stage s + 2. □ 

For each e > 1 we define 



If e is active at all stages then let if) T 2 = Ax. |. Otherwise let s e be the (unique) 




stage where e is declared "passive" and let 




End of Construction. 
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Verification: 

Most of the following claims are standard. The crucial one is Claim 3, b.), a). 
Claim 1 For all e > 1: 

a. ) l(d e ) is nondecreasing, and for all s: If d e (s) ^ d e (s + 1) then l(d e (s + 1)) > s. 

b. ) range(d e ) is a uniformly recursive finite set; range(d e ) C)range(d e ') = 

for all e' ^ e. 

c. ) If A fl range(d e ) contains an element x then x = lim^oo d e (s). 

d. ) For all x, s: If l(x) < s and, for all e, x ^ d e (s), then x ^ d e (s') for all e 

and all s' > s. 

e. ) A is r.e. and nonrecursive. 

Proof: a.) If d e (s) ^ d e (s + 1) then for some s' < s: l(d e (s)) = (e, s') < (e, s + 1) = 
l(d e (s+l)). Note that (e, s + 1) > s, since (— , — ) is monotone in the second argument. 

b. ) It follows from a.) that range(d e ) is uniformly recursive. It is a finite set, 
because d e (s) changes only if a new element is enumerated in some set Ek, k < e 
which happens only finitely often. So lim^oo d e (s) exists and is finite. range(d e ) and 
range(d e >) are disjoint for e ^ e', since (— , — ) is injective. 

c. ) If d e (s) is enumerated into A at stage then e is declared "passive", so 
d e (s) is fixed at all later stages. 

d. ) This follows from a.). 

e. ) Clearly A is r.e. Suppose for a contradiction that A is recursive. Then there 
exists e with A = W e . By a.), b.), there is a stage s such that d e (s') = d e (s) for all 
s' > s. By construction, d e (s) is enumerated into A iff it is enumerated into W e . This 
contradicts the hypothesis A = W e . □ 

Claim 2 For all e > 1: 

a. ) Vv e i , Vv e 2 are uniformly partial recursive. 

b. ) If e is always "active" then ip Te l witnesses that ic^(x : A) < e for all 

x G range(d e ). 

c. ) If e is eventually "passive" then ip Te2 witnesses that ic^[x : A) < e for all 

x G range(d e ). 

Proof: a.) This follows from Claim 1, b.) 

b. ) If e is always "active" then range(d e ) fl A = 0, thus tp Te l is A-consistent and 
i^ Te l = = Xa(x) for all x G range(d e ). 

c. ) If e is declared "passive" at stage s + 1 then A fl range(d e ) = {d e (s)}. Thus 
ip Tea is A-consistent and VVe^W = Xa( x ) for all x G range(d e ). □ 

Let ^ denote the finite portion of i\) e defined at the end of stage s. 
Claim 3 For all s = 2(k,t) + l: 

a. ) For alH, 1 < % < \M k \: ip s Vki is an A-consistent function. 

b. ) For alH, 1 < % < \M k \: If a Ks+1 {i) = 1 then dom{ip s +]) = {x : l(x) < t}. 

c. ) For all x, l(x) < len(k, s + 1): If x £ R(k, s) then there exists i, 1 < % < \M k \, 

with a Ks+l (i) = 1 and %^{x) = Xa s+1 {x). 
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Proof: a.) We use Claim 1, d.) and the fact that i> Pki is defined at stage s + 1 only 
for arguments less than s. If e < k and ip Pk .(d e (s')) is defined then ip Pk .(d e (s')) =_L, 
so there is no problem with consistency. If e > k and ip Pk .(d e (s')) = XA a+1 (d e (s')) 
is defined at stage s + 1 > s', then either e is already "passive", so XA s+ i(d e (s')) = 
XA{d e (s')), or e is "active" and we define d e (s + l) at stage s+1 such that /(<i e (s + l)) > 
l(d e (s')). In the latter case we get ijj pk>i {d e {s')) = XA 3+1 (d e (s')) = XA{d e (s')) = 0. 
b.) and c.) are shown by induction on s. Consider stage s + 1 = 2(k,t) + 2. If no 
new element is enumerated in E k after exactly t steps then a k)S +i = o~k,s an d b.), c.) 
follow from the induction hypothesis and the definition of ip Pk . at stage s + 1. 

Now assume that x enters E k after exactly t steps. If case a.) occurs, the claim 
follows from the induction hypothesis. If case b.) occurs, we have x G" R(k, s) and 
len(k,s) < l(x) < t. We have o~k, s +i = succ(ak,s), so ak, s +\{i') = cr^st^') for all 
i! > i = mm{j : a k>s +i(j) = 1}. 

If cr/ CjS+ i(i / ) = for all i' > i then s + 1 is the first stage s' where ak, s '{i) = 1- This 
means that ip Pki — Ax. j and ipp^(z) = Xa s (z) = XA s+1 (z), for all z such that l(z) < t 
and z ^ R(k, s). 

If there is i f > i with = 1, then there exists a greatest stage s' < s with 

o~k,s'(i) = 1 A cr fc)S / +1 (i) = 0. Then we have Ck^'+iii 1 ) = o"/fc,s+i(^) for all i' > i and we 
have crfc >s / + i(i / ) = o-k, s +i{i') = for all i' < i. By induction hypothesis, we get for all 
x with l(x) < len(k, s' + 1): If x ^ R(k, s') then there exists j with ak, s '+i{j) = 1 and 
ipp+^x) = XA a , +1 (x) = Xa( x ) (the second equality holds by part a.)). 

Since R(k, s') C R(k, s), it only remains to consider x with len(k, s' + 1) < /(x) < 
len(k,s + 1) = t + 1. As Ofcy(i) = 1 it follows, by induction hypothesis, that 
dom(ipp k .) = dom(ipp ki ) = {x : l(x) < len(k,s' + 1)}. Thus, n = min{/(,2) : z ^ 
dom{%ljp ki )} = len(k,s' + 1) and at stage s + 1 we define ip^(z) = Xa( z ), for all 
z ^ R(k, s) such that len(k, s' + 1) = n < l(z) < t + 1 = len(k, s + 1). For z e R(k, s) 
and l(z) < t we have ip^ l {z) =+. This completes the proof of b.), c). □ 

Claim 4 For almost all : A) < logC(x) +2. 

Proof: Let k > 1 be minimal such that x G E^. If x G R(k, s) for some s then, by 
Claim 2, we get zc^(x : A) < k. If x G" R(k,s) for all s then let = lim^oo cr^. 
By Claim 3, there exists i, 1 < i < \M k \ such that 0^(2) = 1 A ip Pki {x) = Xa(%)- 
Furthermore, ip Pki is total recursive and A-consistent, so %c^{x : A) < k. Since 
= 0, we have k > 1 and x £ E k _ u so 2 fc ^ x -2 < C(x), i.e., < log(C(x) +2) + 1 < 
logC(x) + 2 for all x with C(x) > 2. □ I 

What happens for ic ? Of course, the instance complexity conjecture also fails for ic. 



It even fails in a much stronger way, because, in contrast to Theorem 3+, ic can be 
arbitrary small, as we now show. 

Theorem 3.3 For every recursive function f there is an r.e. nonrecursive set A such 
that 

f(ic(x : A)) < C(x) for almost all x. 

Proof sketch: We may assume that / is strictly increasing. As above it suffices to 
define a partial recursive function ip(p,x) such that f{ic.,p{x : A)) < C(x) for almost 
all x and A is nonrecursive. This leads to the following requirements for all i > 1: 
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(Ni) (Vx)[C(x) < f(i + 1) (3p G {0, 1}*)[xa extends ^ and ^ p (x) = Xa{x)}}. 
(PA ^ ^ A. 

These can be satisfied by an easy finite-injury construction. Fix an enumeration of 
E i = {x: C(x) < f(i + 1)} for all i. 

During the construction we have for each i a current pi G {0, l} 1 which satisfies 
(N^ for all x that have been currently enumerated into A. If some x with ip Pi (x) = 
is later enumerated into A, then ip p . is no longer A-consistent and we have to choose 
a new p^. Since we have 2 l candidates for p i7 we can afford 2 l — 1 injuries. 

Therefore, we allow to enumerate a diagonalization witness x into A at stage s for 
the sake of (Pi), only if x has not yet appeared in any Ej with j < i. Clearly, (Pi) 
can still be satisfied. Furthermore, (Ni) is injured at most % times. Since i < 2 l — 1 
for all i > 1, every (iV,) will be eventually satisfied. | 

Remark: In the course of the construction at most 2^ +1 ) — 1 elements are not allowed 
to be enumerated into A by (Pi). Hence, we can fix in advance a set Jj of 2^* +1 ) 
witnesses for (Pi) and guarantee that one of them will be successful. Therefore, we 
can also modify the construction and satisfy the following requirements (P[) instead 
of (Pi) for any fixed r.e. set B 

(Pi) ieB J t n A ^ 0. 

Then we get B <d A. If we choose B = K, this shows that there is a d-complete set 
which satisfies the condition of the theorem. Since we need to enumerate at most one 
element of Jj into A, we get that A < w ttm B. Thus, every r.e. wtt-degree contains a 
set A as in the theorem. It can be shown that this does not hold for r.e. tt-degrees. 

4 R.e. sets having hard instances 

While we have shown in the last section that ICC fails for some nonrecursive r.e. sets, 
it is interesting to find out whether there are properties of r.e. sets which imply the 
existence of hard instances. We consider this question for classes of complete sets 
and of simple sets. Indeed, in most cases it turns out that such sets must have hard 
instances, which is a partial resurrection of ICC. 

Buhrman and Orponen ||, || Exercise 7.40] proved that the set of all random 
strings R = {x : C(x) > l(x)} satisfies ic(x : R) > l(x) — 0(l) for all x G R. (Actually, 
their result also holds for ic instead of ic.) Using the observation 

(*) If A < rn B via /, then ic(x : A) < ic(f(x) : B) + 0(1) for all x. 

and the fact that R is co-r.e., they conclude that every m-complete set A has hard 
instances in its complement. They asked whether the hard instances can be chosen 
from A instead of A. (This is of course impossible in the ic- version.) The next result 
gives a positive answer. 

Theorem 4.1 There is an r.e. set A with ic(x : A) > l(x) for infinitely many x G A. 
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Proof: Uniformly in n we enumerate A fl {0, 1}™ as follows: Let x±, . . . , be a 
listing of all strings of length n in lexicographical order. 

Step 0: Enumerate Xi into A, let i = 1, / = {0, l}^ 1 , J = {1, . . . , 2"} - {1}. 
.S^ep s + 1: If there is a string p G / such that 

(a) U s (p,Xj) G {0, 1} for some j G J, or 

(b) U s (p,Xj) =_L for all j G J, 

then choose the least such p, let J = / — {jo}, and do the following: 

In ): Enumerate Xj into A iff U s (p,Xj) = 0. Let J = J — {j}. 

In case (b): Let « = min(J). Enumerate Xi into A and let J = J — {i}. □ 

At the end of Step we have |/| = |J| = 2™ — 1. In all later steps an element of 
/ is removed iff an element of J is removed. Thus, at the end of each step we have 
|/| = \J\. Also, if case (b) occurs then min(J) exists (since at that point \J\ > 0). 
Note that the value of XA(xj) is fixed when j is removed from J. 

Let io, Iq, Jo be the final values of i, I, J in the above construction and choose so 
such that i — i , 1 — I , J — J in all steps t > s . Suppose for a contradiction that 
ic(x io : A) < n via p G {0, 

If p Io then there is a stage s < sq when p was removed from /. If p was 
removed in case (a) via j, then U(p,Xj) ^ XA{xj)- If P was removed in case (b) then 
U(p,Xi ) =_L. Hence, p does not witness that ic(xi : A) < n, a contradiction. 

If p G I then | J | = |/ | > 1 and there is t > s such that U t (p,x) G {0, 1, _L} for 
all x G J - Hence, at stage t + 1 either case (a) or case (b) occurs and |/ | decreases, 
contradicting the choice of So- 

Thus, we have ic(xi : A) > n = l(xi ) and clearly x io G A. Since this holds for all 
n, the theorem is proved. | 

Using (*) we get the following corollary. 

Corollary 4.2 For every m-complete set A there is a constant c such that 
ic(x : A) > C{x) — c for infinitely many x G A. 

This result also holds for a much weaker reducibility, as we now show. 

Theorem 4.3 For every wtt- complete set A there is a constant c such that 
ic(x : A) > C(x) — c for infinitely many x G A. 

Proof: Suppose that A is a wtt-complete set. We enumerate an auxiliary r.e. set 
B and a uniformly r.e. sequence {E n } n( zj^ with \E n \ < 2 n . Then there is a partial 
recursive function ip : {0,1}* x {0,1}* — > J\f such that ^({0, l} n , A) = E n . Hence, 
C^{x) < n for all x G E n and there is a constant c, independent of n, such that 
C(x) < n + c for all x G E n . Thus, it suffices to satisfy the following requirement for 
all n 

(R n ) (3x G E n n A)[ic(x : A) > n - 1]. 

By the recursion theorem and the fact that A is wtt-complete, we can assume that we 
are given in advance the index of a wtt-reduction from B to A, i.e., a Turing reduction 
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$ and a total recursive use-bound g such that, for all x, Xb(x) = and in the 

computation of every query is less than g(x). 

Each (R n ) is satisfied independently from the other requirements; so for the fol- 
lowing fix n and let x\ = (n, 1), . . . , £2™ = (n, 2 n ), m = max{g(xi) : 1 < i < 2™}, and 
I = {p : l(p) < n — 1}. We enumerate E n and B D {xi, . . . , x 2 ™} in steps i = 0, . . . ,2 n 
as follows: 

Step 0: Let s = 0,E n = 0. 

Step i + 1 : Search for the least s > s , such that 

(1) $^ s (x,) = 1 with use less than g(xj) for j — 1, . . . ,i and $^ s (xj) = with use 
less than g(xj) for j = i + 1, . . . , 2 n . 

(2) For each x E E n there is p G / such that 

(2.1) £7 s (p, z) is defined for all z < m. 

(2.2) (Vz<m)[?7 s (p,z)^J_^ U s ( P ,z)=xaM}- 

(2.3) C/ S (p,a;) = 1. 

Let Si + \ = s. Enumerate Xi + \ into B and compute some x < m with x E A — A Si+1 . 
(Note that x exists because otherwise $ A (xj + i) = 0^1 = XB(xi+i)- We can find a; 
by enumerating A.) Let CONS be the set of all p G / which satisfy conditions (2.1) 
and (2.2) for s = s i+ i. If U s (p,x) =_L for all p G CONS, then enumerate x into E n . 
Goto step i + 2. □ 

By construction, we have E n C A. We want to argue that in some step of the 
construction the search does not terminate. Since Xb(x) = & A (x), this can only 
happen if condition (2) is not satisfied for any sufficiently large s. But this means 
that ic(x : A) > n — 1 for some x G E n . 

Consider the value of CONS C I after each terminating step: We show that a new 
element enters CONS or an element is removed forever from CONS. Since there are 
at most |/| < 2 n_1 strings which may at some point become a member of CONS, it 
follows that there are less than 2 • 2 n ~ 1 = 2™ terminating steps, which completes the 
proof. 

Note that if a string p is removed from CONS at some stage s, then there is x 
such that U s (p,x) = and XaX x ) = 1- Thus, x cannot enter CONS again at any 
later stage. 

Suppose that step i + 1 terminates and consider the current value of CONS and 
of x at the end of this step. There are two cases: 

(a) U s (p,x) =_L for all p G CONS. Then x is enumerated into E n , so in the next 
step a new string must enter CONS such that condition (2.3) is satisfied for x. 

(b) U s (p,x) t^_L for all p G CONS. Hence, U s (p,x) = and, since XA Si+2 (x) — 1, P 
is removed from CONS if the next step terminates. | 

By a similar proof, one can show that every btt-complete set has hard instances 
w.r.t. ic. We have noticed in the remark following Theorem |3.3| that this is no longer 
true for d-complete sets. But we can show that it still holds for Q-complete sets. 
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Recall that A is Q-complete if it is r.e. and there is a recursive function g such 
that for all x: 

x G K W g ( x ) C A. 



See [10, p. 281 f.j for more information on Q-reducibility. 



Theorem 4.4 Every Q-complete set A has hard instances, even w.r.t. ic. 

Proof: Suppose that A is Q-complete. As in the previous proof we enumerate an 
auxiliary r.e. set B and an r.e. sequence of finite sets {E n } nE j^f such that \E n \ < 2 n . 
It suffices to get infinitely many n such that there is y G E n with ic(y : A) > n — 2. 

By the recursion theorem and the Q-completeness of A, we may assume that we 
are given in advance a recursive function g such that B <q A via g, i.e., for all x, 
x e B ^ Wg( x ) C A. 

The first idea is to run a version of the previous construction: We keep a number 
x out of B and find y G W g ( x ) which has not yet been enumerated into A. Then we 
enumerate y into E n and wait until some A-consistent program p with l(p) < n — 2 
shows up and U(p,y) = 0. Then we enumerate x into B, which forces y into A and 
diagonalizes p. 

However, this approach does not work, because it might happen that after we 
enumerate y into E n , y is also enumerated into A, and after that U (p, y) = 1 is defined. 
Then we cannot diagonalize p by enumerating x into B, but we have incremented \E n \. 
Since this can happen an arbitrary finite number of times, we run into conflict with 
the requirement \E n \ < 2 n . 

Therefore, we use the following modification: For each n, if E n ^ then we 
enumerate y into E n only if y has been previously enumerated into E n+ \, and then 
we proceed according to the first idea. If later y is enumerated into A we get a 
diagonalization for n + l instead of n, which is also fine. 

Now we turn to the formal details: Let I n = {p : l(p) < n — 2}. p G {0, 1}* 
is called A-consistent at stage s + 1 if, for all z < s, either U s (p,z) is undefined or 
U s (p,z) = Xa s (z). 

We maintain the following invariant for all n, s, y: 

If E n 7^ at stage s + 1 then enumerate y into E n only if P(n, s, y) holds, where: 

P(n, s,y) <^>)/G E n+ i — A s , E n C A s , and there is p G I n+ i which 
is A-consistent at stage s + 1 and U s (p,y) = 0. 

As a consequence of this invariant it already follows that \E n \ < 2 n : Suppose that 
E n ^ and we enumerate y into E n at stage s + 1. Then we enumerate the next 
element into E n only after y has been enumerated into A, and hence the program 
p G I n+ i which had witnessed the condition P(n, s, y) is diagonalized and can never 
be A-consistent again. Since |/ n +i| < 2 n_1 , it follows that we will enumerate at most 
1 + 2 n ~ l programs into E n . In particular, \E n \ <2 n for all n. 

We say that n is saturated at stage s + 1 if , for every y G E n , there is p G I n 
such that p is A-consistent at stage s + 1 and U s (p,y) = XA s (y)- The goal of the 
construction is to produce infinitely many n which are almost always not saturated. 
This implies at once that there are infinitely many y G E n with ic(y : A) > n — 2, 
and we are done. 
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To achieve this goal we construct a sequence d < d\ < d 2 < ■ ■ ■ and satisfy the 
following requirements 

(Ri) The interval [di, rfj+i) contains an n which is almost always not saturated. 

The diS are constructed by recursive approximation: The value of di may change 
finitely often and eventually stabilizes. Some additional variables are needed for 
book-keeping: For each % there is a finite set Tj containing the set of all x which 
may be enumerated into B for the sake of (Ri). For each n we have three variables 
active(n),cand(n), source(n). active(n) is a Boolean flag which indicates if there 
is some y G E n — A s to be enumerated into E n _\\ in this case cand(n) = y and 
source(n) = x such that x £ B s and y G W g ^ ;S . 

We say that % requires attention at stage s + 1 if one of the following conditions 
holds at the beginning of of stage s + 1. 

(1) di+i is undefined. 

(2) di+i is defined and every n G [di, dj+i) is saturated at stage s + 1. 
Construction: 

Stage 0: Let do = 0, d i+ i =f, Tj = for all «. Let active(n) = 0, _E n = for all n. 
S'tage s + 1: For every n such that active(n) = 1 and cand(n) G A s let active(n) = 0. 
Let i be the least number which requires attention at stage s + 1. If it requires 
attention through (1) then let di + i = s + 1. 

If it requires attention through (2) then we distinguish two cases: 

(a) If there is a least n G (di,d i+ i) such that active(n) = 1 and E n _\ C A s , then 
enumerate cand(n) into E n -\ and let active(n) =0. If n — 1 = di then enumerate 
source(n) into 5, else let active(n—l) = 1, cand(n— 1) = cand(n), and source(n— 1) = 
so-urce(n). 

(b) Otherwise put s+1 into Tj and let x = min(Tj — B s ). Find the least s' such that 
W^(a;),s' — A s 7^ and let y = min(W g ( a; ) iS / — A s ). Let active(s + 1) = 1, cand(s + 1) = 
y, source(s + 1) — x, and enumerate y into E s+ \. 

In both cases let T { = T { U Uj>i ^- and let 7} = 0, dj =|, for all j > i. 
End of Construction. 

It easily follows by induction on s that our invariant is satisfied: Note that before we 
enumerate a new number into E n _i via step (a), we require that E n _i C A s . If we 
enumerate a number via step (b) then the corresponding set was previously empty. 
Therefore, at each stage s + 1 every E n contains at most one number which is not in 
A s . Now suppose that E n ^\ ^ at the end of stage s and we enumerate a number 
y into E n _i at stage s+1. Then case (a) occurred and y = cand(n) G" A s (since 
active(n) = 1). By the previous remarks, we have E n _i C A s . Since n is saturated 
at stage s + 1, there is an A-consistent p G I n such that U s (p, y) = Xa s (u) = 0. Thus, 
P(n — 1, s, y) holds. 

Hence, it only remains to verify that requirement (Ri) is satisfied for all i. This is 
done by induction on i. By induction hypothesis, there is a least stage s such that 
di = s is defined at stage s and no i' < i requires attention at any stage s > s . 
At the end of stage s we have E d . = and T, = 0. We have shown above that the 
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cardinality of is always bounded by 2 di . Hence, there exists s\ > so such that 
does not change after stage s±. Note that Ed. C A, because each time when we 
enumerate y into E^, we enumerate some x into B such that x G B W s (a.) C A 
and ?/ G W^); thus we force y into A. So we can choose Si large enough such that 
Edi Q A s for all s > s\. 

Suppose for a contradiction that i requires attention infinitely often. We will argue 
that at some stage S2 > si a new element is enumerated into E^., which contradicts 
the choice of s±. There is a first stage s + 1 > si where z requires attention through 
(2); let Xq = source(s + 1). If y = cand(s + 1) G" A then there is a stage s' > s such 
that s + 1 is the least n > di with adivein) = 1 and E n _i C A s . In the following 
stages when z requires attention, y will be enumerated into E s , E s _i, and finally 
into Edi, which gives the desired contradiction. If y G A, it might happen that y is 
enumerated into A before it arrives in Ed t . But then a new candidate y' from W g < xo ) is 
chosen and a new attempt is started to bring y' into ■ Again, it might happen that 
y' is enumerated into A before it arrives in E^. However, this process cannot repeat 
infinitely often, because otherwise x G" B and hence there is some y G W g ( Xo ) — A. 
This y would in some iteration be chosen as a candidate which cannot be enumerated 
into A. So, at some stage S2 + 1 > si some y is enumerated into E^. Since y G" A s , 2 
and Edi — A S2 , this implies that Ed t increases, a contradiction. 

Thus, i requires attention only finitely often and (RA is satisfied. This completes 
the proof of the inductive step. | 

Recall that A is strongly effectively simple if it is a coinfinite r.e. set and there is 
a total recursive function / such that for all e, 

W e CA^ max(W/ e ) < /(e). 



Since every strongly effectively simple set is Q-complete HlO| , Exercise III. 6. 21, a)] we 
get the following corollary. 

Corollary 4.5 Every strongly effectively simple set has hard instances, even w.r.t. 
ic. 



It is known that hyperhypersimple sets are not Q-complete |10|, Theorem III. 4. 10], 
but we can still show that they have hard instances. 

Theorem 4.6 Every hyperhypersimple set has hard instances, even w.r.t. ic. 

Proof: The basic idea of this proof is similar to the previous one. Assume that 
A is hyperhypersimple. We enumerate an r.e. sequence of finite sets {E n } ne j^ such 
that \E n \ < 2 n . It suffices to get infinitely many n such that there is y G E n with 
Ic(y : A) > n - 2. 

Let I n = {p : l(p) < n — 2}. We initialize E n = {n} and may later enumerate 
numbers from E n into E n _\. This time we ensure that at any stage s at most two 
numbers of E n belong to A s . We never enumerate a number twice into the same set. 
Furthermore, we enumerate x into E n at stage s + 1 only if there is p G I n +i which is 
A-consistent at stage s + 1 and U s {p,x) = 0. 



Kolmogorov Complexity and Instance Complexity 



17 



From this invariant it already follows that \E n \ < 2 n : It is easy to see, by induction 
on k, that we enumerate the (2k + l)-st number into E n at stage s+1 only if there are 
at least k programs p from I n+ i which were A-consistent at some previous stage and 
are now diagonalized (i.e., for each such p there is z G E n n A s such that U s (p, z) = 0). 
Since there are less than 2 n_1 programs in I n+ \, it follows that \E n \ < 2 • 2 n ~ 1 + 1 = 
2 n + l. 

As in the previous proof, we say that n is saturated at stage s + 1 if for every 
y G E n there is p G /„ such that p is A-consistent at stage s + 1 and U s (p, y) = Xa s (u)- 
We want to produce infinitely many n which are almost always not saturated. 

To this end we construct for each e a sequence d$ < d\ < ■ ■ ■ such that for each i, 
\A fl Ed<r I > 1 or there is n G [df, which is almost always not saturated. Suppose 
we have constructed at the end of stage s an initial segment of this sequence, say 
dp < • • • < d e m+x . Let count(n,s) = \A S fl E njS \. We extend this initial segment at 
stage s + 1 only if count (df, s) > 1 for all i < m. In the end we shall be able to argue 
that if the sequence is infinite then there is a weak array which witnesses that A is 
not hyperhypersimple. Thus, the sequence must be finite, say d% < • • • < d e m ^ +1 , 
and there is n G [d e m ^, d e m ^ +l ) which is almost always not saturated. Also, since the 
strategy to extend the e-th sequence is active at only finitely many stages, we can 
build an (e + l)-st sequence with d$ +1 > d e m ^ +1 , which will also be finite and gives 
us another number that is almost always not saturated, etc. 

We assign priorities as follows: The definition of the e-th sequence has higher 
priority than the definition of the e'-th sequence if e < e'. The definition of the i- 
th member of the e-th sequence has higher priority than the definition of the i'-th 
member if i < i' . Hence, we take the lexicographical ordering <i cx on J\f x J\f as our 
priority ordering. 

For technical reasons we enumerate for each e a set M e . When we are working on 
the e-th sequence we try to establish for each d\ a number x G E^ — A. In M e we 
enumerate the current candidate for x. 

We say that (e, i) requires attention at stage s + 1 if one of the following conditions 
holds at the beginning of stage s+1. 

(1) d\ is undefined and for all j G [0,i — 1): count(dj, s) > 1 and every n G [dp d? +1 ) 
is saturated at stage s + 1. 

(2) df, d e i+1 are both defined, count(df, s) = 0, and every n G [df, d e i+1 ) is saturated 
at stage s + 1. 

Construction: 

Stage 0: Let d\ —\ and M e = for all e, i, and let E n = {n} for all n. 

Stage s + 1: Choose the lexicographically least (e,i) which requires attention at stage 

s + 1. 

If it requires attention through (1) then let d\ — s + 1, enumerate s + 1 into M e , 
and let d e - =| for all (e',j) >\ cx (e,i). 

If it requires attention through (2) and there is a least n G (cif,(if +1 ), such that 
count(n — 1, s) < 1 and there is a least x G E n>s — (A s U M C)S U E n -i jS ), then enumerate 
x into E n -i. If in addition n — \ — d\ then enumerate x into M e . In any case, let 



Kolmogorov Complexity and Instance Complexity 



18 



d e . =1 for all (e'J) >i cx (e,i). 
End 0/ Construction. 

It easily follows by induction on s that count(n, s) < 2 for all n, s, in particular, 
l-E 1 fl A\ < 2. Also, we enumerate at stage s + 1 a number x from E n into -E n -i only 
if it does not yet belong to E n _i fl A and n is saturated. In particular, there is a 
program p G J n+ i which is A-consistent at stage s + 1 and U s (p,x) = 0. 

Claim: For every e, there are only finitely many stages where (e, i) requires attention 
for some i. 

Proof: Suppose for a contradiction that there exists a least e and infinitely many s 
such that (e,i) requires attention at stage s + 1 for some Then we argue that A 
is not hyperhypersimple. First, there is a least stage so > 1 such that no (e',i') with 
e' < e requires attention at any stage s > so- Then we define d$ = s at stage s and 
we enumerate Sq into M e . By the choice of Sq, the value of d$ has stabilized. Note 
that all numbers which have been previously enumerated into M e are less than sq and 
so they do not matter for the following. By induction on s > Sq, it follows that E UyS 
contains at most one number from M CyS — A s for all n > d$. 
Now we distinguish two cases: 

(a) If there is a least i such that (e, 1) requires attention infinitely often then there is 
a stage s\ > sq where all d^ with j < % have stabilized. Thus, (e, i) infinitely often 
requires attention through (2) and d\ +1 tends to infinity. But then it follows similarly 
as in the previous proof that unboundedly many numbers are eventually enumerated 
into Ea? which contradicts the fact that the cardinality of E^? is bounded: 

If {e,i) requires attention through (2) at any stage s > si then count(d^,s) = 0, 
thus an (e,j) with j > i cannot require attention through (2) at any later stage 
s' > s, until a new number is enumerated into E d ? and count(df, s') = 1. During 
that time M e does not change. This guarantees that eventually a new number is 
enumerated into E d e, since there exist numbers z E (Un>d e E njS ) - (AUM e U E d e yS ). 
Since \E HyS fl (M £yS — A s )\ < 1 for n > d^, it causes no problems to maintain the 
constraint that a number x is enumerated from E n into E n _i at stage s + 1, only if 
x <£ (M £)S UA S ). 

(b) If for every % there are only finitely many stages (but at least one stage) where 
(e, %) requires attention, then it follows that the values d\ stabilize and form an infinite 
increasing sequence. Let d\ denote the final value. Since the sequence is infinite it 
follows that lim s count[d\ , s) > 1, thus \E d ? n A\ > 1. From the actual construction 
we get \E d e n A\ = 1 and E d e n A C M e . 

Uniformly in i we enumerate an r.e. set Ui as follows: If there is a stage s + 1 > s 
where (e, i) is the least pair which requires attention through (2) and a number x is 
enumerated into E d ?, then enumerate x into Ui. 

Since each such x is also enumerated into M e and is therefore blocked for the 
other sets, it follows that the C/j's are pairwise disjoint. By the remarks above, each 
Ui intersects A. Thus, A is not hyperhypersimple. This contradiction completes the 
proof of the claim. □ 

Thus, for each e there exists a maximal m(e) > such that the value of d e m ^ +1 
stabilizes and no (e, j) with j > m(e) + 1 requires attention at any sufficiently large 
stage. This means that there exists n e [^(e)> ^m(e)+i) wri ich is almost always not 
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saturated. Thus, there is y G E n with ic(y : A) > n — 2. Clearly, we get infinitely 
many pairwise different such y's. This completes the proof. | 

The previous result does not hold for hypersimple sets, since one can construct a 
hypersimple set that does not have hard instances. This can be done, e.g., by a direct 
modification of the proof of the next theorem. 

Recall that A is effectively simple if it is a coinfinite r.e. set and there is a recursive 
function / such that for all e, 

W e CA^ \W e \ < /(e). 

It is known that every effectively simple set is T-complete [0, Proposition III. 2. 18]. 

Theorem 4.7 There is an effectively simple set which does not have hard instances. 
In particular, there is a T-complete set which does not have hard instances. 



Proof sketch: The construction in the proof of Theorem [O] is not combinable 
with the requirement of making A effectively simple. Therefore, we use a modified 
version were we do not attempt to have the instance complexity as low as possible. 

In the following we outline the construction. A will be effectively simple for some 
/ to be determined later. As in the proof of Theorem |3.2| we are given a uniformly r.e. 
sequence {E k } k£j ^ and we build a partial recursive function ip such that for almost 
all k and for each there is some p G {0, l} k witnessing that ic^(x : A) < k. 

How do we define ip p l We will keep a list S = S k of programs of length k. The 
length of S will be fixed (depending on k). Furthermore, we have a pool P = P k of 
unused programs of length k. At the beginning |,S| + |P| = 2 k . During the construction 
some of the programs in S may become inconsistent with A, in which case they are 
removed from S and new programs from P are inserted into S. There may also exist 
a "back-up program" chosen from P. 

The programs in S will be defined at x with a 0/1- value only if x was enumerated 
into E k . The definition proceeds in a round- robin fashion: The first program in S 
takes care of the first number which is enumerated into E k , the second program takes 
care of the second number, and so on. In this way we handle the first \S\ numbers. 
Ideally, we would like that again the first program takes care of the (\S\ + l)-st number, 
etc. However, this does not work, because as soon as a program was brought into 
play we have to define it for larger and larger inputs. So it might happen that all of 
our programs are already defined (with output _L) at x when x is enumerated as the 
\S\ + 1-st number at stage s. 

Thus, we are using a program q from P which is still everywhere undefined and 
define it as Xa s { z ) for all z < s, in particular this covers all numbers currently in E k . 
For all larger values we output _L q is called the current back-up. We also suspend 
defining the programs in S until new numbers x > s are enumerated into E k . Then 
we continue as above for the next \S\ such numbers. After that a new program from 
P is defined as the current back-up in a similar way as q, and so on. 

What is the advantage of that scheme? It is more robust against injuries which 
may happen when a number x with ip p (x) = is later enumerated into A. In that 
case only one p G S is destroyed. Also, only the x G E k are critical because for x G" E k 
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we have i^ p {x) =_L. If p is destroyed then we assign a new program from P as a 
substitute. 

A crucial part in this process is the definition of the new back-up q when a round 
has been completed at the beginning of stage s. Before we define if) q , we enumerate 
all x < s into A which do not belong to any E n with n < g(k): This defines the 
current A s . Here g is some fast growing function to be determined later. Then we 
define ip q (x) = Xa s {x) for all x < s, and if) q (x) =_L otherwise. 

We use the following strategy to make A effectively simple. If at the end of some 
stage s we have W e>s C A s and |W e , s | > /(e), then choose an x G W e>a which does not 
belong to any E n with n < g(e) and enumerate it into A. Note that x exists if we 
choose / large enough such that /(e) > \Eq\ + \E\\ + • — h \E g ^\. 

This completes the description of the construction. It remains to choose the pa- 
rameters such that it works. We first count how many of the ipp with l(p) = k are 
used. Then we choose \E^\ and g in such a way that the number of used programs is 
less than 2 k . 

Let m = max{n : g(n) < k}. Then for each i < m there can be many 
rounds and after each round all programs in Sk may be destroyed (and have to be 
replaced by new ones from P^). At this time it is important that after the action of 
i we immediately define the new programs that replace the former ones which have 
been destroyed. We can do this without any further enumeration of elements into A. 
There is no cascading effect which could blow up the number of injuries. Thus, at 
most |S'fc|E^ =1 [|£'j|/|S'i|] many programs in Sk are ever injured. 

How many of the back-up functions are destroyed? Note that this may happen 
each time when some i < k acts, i.e., whenever % completes a round. Thus, at most 
S^Tq 1 H-E^I/ISil] many back-up functions are destroyed. 

The number of injuries from making A effectively simple can be bounded bym+fc: 
If we act for the sake of W eiS D A ^ (which happens at most once), then a program 
from Sk can be destroyed only if e < m, and a current back-up program can be 
destroyed only if e < k. To see the latter, note that if the current q is defined at 
x ^ E U . . . U E g rk)i then ip q (x) G {1, _L} because of the additional enumeration of 
numbers into A which was performed when q was brought into play. 

Thus, we need to ensure that for almost all k: 

(+) 2 k > \s k \^r=im\/\Si[\ + s^n^i/i^n +m+k. 

Let \S k \ = [2 k /k\, g(k) = 2 fc , and E k = {x : C(x) < 3k/2}, so \E k \ < 2 3fe / 2 . 
Define the recursive function / by /(e) = |~X)f=o 2 3 */ 2 ]. The right hand side of (+) is 
bounded above by 

(27*:) (log A;) ^ + k 2 2 k/2 + \ogk + k 

which is less than 2 k for all sufficiently large k. 

With this choice of parameters we get for almost all x, C(x) > (3/2)(ic^(x : A) — 1), 
i.e., ic^{x : A) < (2/3)C(x) + 1. Thus, A does not have hard instances. | 

The previous results characterize the reducibilities < r with r G {m, btt, c, d, p, 



tt, wtt, Q, T} (cf. the figure in |10], p. 341]) such that every r-complete set has hard 
instances, for both ic and ic. In the following table we have marked the possible 
combinations. 
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btt 
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p 


tt 


wtt 


Q 


T 


ic 


X 
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X 


X 


X 


X 


X 


X 




ic 


X 


X 


X 










X 





Remark: The T-degrees of r.e. sets with hard instances do not coincide with any of 
the known degree classes. It can be shown that they form a proper subclass of the r.e. 
nonrecursive degrees and that they properly extend the array nonrecursive degrees. 

Acknowledgments: I would like to thank Lance Fortnow, Bill Gasarch, and Paul 
Vitanyi for comments on a preliminary version of this paper. 
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